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Abstract — MillimeterWave communications in the 60 GHz 
band are considered one of the key technologies for enabling 
multi-gigabit wireless access. However, the high propagation loss 
in such a band poses major obstacles to the optimal utilization 
of the wireless resources, where the problem of efficient client 
association to access points (APs) is of vital importance. In 
this paper, the client association in 60 GHz wireless access 
networks is investigated. The AP utilization and the quality 
of the rapidly vanishing communication links are the control 
parameters. Because of the tricky non-convex and combinatorial 
nature of the client association optimization problem, a novel 
solution method is developed to guarantee balanced and fair 
resource allocation. A new distributed, lightweight and easy to 
implement association algorithm, based on Lagrangian duality 
theory and subgradient methods, was proposed. It is shown 
that the algorithm is asymptotically optimal, that is, the relative 
duality gap diminishes to zero as the number of clients increases. 
Both theoretical and numerical results evince numerous useful 
properties of the algorithm, such as fast convergence, scalability, 
time efficiency, and fair execution in comparison to existing 
approaches. It is concluded that the proposed solution can be 
applied in the forthcoming 60 GHz wireless access networks. 

Index Terms — 60 GHz Wireless Access Networks, Resource 
Allocation, Association Control 



I. Introduction 

MillimeterWave (mmW) communications have recently at- 
tracted the interest of academia, industry, and standardiza- 
tion bodies, although the technology was invented and used 
many decades ago, especially in the context of military ap- 
plications (TJ, 0. mmW communications utilizes the part 
of the electromagnetic spectrum between 30 and 300 GHz, 
which corresponds to wavelengths from 10 mm to 1 mm 0. 
The continuous development of Complementary Metal Oxide 
Semiconductor technology |0), with low cost and low power 
consumption, has enabled the use of the mmW spectrum for 
wireless communications, including the provisioning of quality 
of service (QoS) sensitive applications. 

Due to the 60 GHz great commercial potential, multi- 
ple industry-led efforts and international organizations have 
emerged for the standardization 0. Examples include 
IEEE 802.15.3 Task Group 3c (TG3c) @, IEEE 802.1 lad 
standardization task group [7|, the WirelessHD (8) consortium, 
the Wireless Gigabit Alliance (WiGig) 0, and many others. 
More than 5 GHz of continuous bandwidth is available in 
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many countries worldwide, which makes 60 GHz systems 
particularly attractive for gigabit wireless applications such 
as gigabyte file transfer, wireless docking stations, wireless 
gigabit ethernet, wireless gaming, uncompressed high defi- 
nition video transmission. Scenarios such as small-cells [10] 
and mobile data offloading ifTTI . which are nowadays strongly 
motivated by the increased end-user connectivity requirements 
and mobile traffic, can be accommodated with the use of 
60 GHz radio access technology. 

The 60 GHz huge bandwidth offers many benefits in terms 
of capacity and flexibility. For example, even with a low spec- 
tral efficiency such as 0.4 bps/Hz, 60 GHz communication sys- 
tems can provide a very high data rate of 1 Gbps, making it an 
ideal candidate to support simple modulation techniques |12|. 
In contrast, ultra-wideband (UWB) systems require roughly 2 
bps/Hz to achieve 1 Gbps, but its actual deployment is limited 
to 400 Mbps at 1 m operating range iTOl . The estimated 
spectral efficiency for IEEE 802.1 In lfl4l communication 
systems is 25 bps/Hz for achieving 1 Gbps, which makes 
their application to high bandwidth services unacceptable in 
terms of cost and simple implementation |12|. Moreover, the 
60 GHz communication systems are less restricted in terms of 
power limits, as opposed to UWB systems 1151 . The regulation 
related to 60 GHz band allows much higher transmission 
power, compared to other existing wireless local area and 
personal systems 0. 

However, there are several challenging aspects and poten- 
tialities that must still be addressed in mmW communications. 
On the negative side, there are severe channel attenuations. 
Higher power levels are necessary to overcome the high 
path loss. The equivalent isotropic radiated power in the 
60 GHz system is approximately 10 times larger than the 
IEEE 802.1 In (H and 30000 times larger than the UWB 
systems. The typical operation range of 60 GHz communi- 
cations is 10 to 20 meters |6j. For example, a path loss of 
2 meters results in attenuation of approximately 74 dB. Thus 
directionality and blockage problems may be frequent. On the 
positive side, the interference levels for 60 GHz are much 
lower compared to the congested 2.4 GHz and 5 GHz bands. 
The compact design of 60 GHz radio allows the use of 
multiple-antenna solutions at the user device. Compared to 
5 GHz systems, the form factor is approximately 140 times 
smaller and thus can be conveniently integrated into consumer 
electronic products. The exploitation of these unique charac- 
teristics is essential for efficient resource allocation algorithms. 

In this paper we address the fundamental resource allocation 
problem of the client association in 60 GHz wireless access 
networks. Such a problem is more challenging in 60 GHz 
band than traditional wireless networks since the wireless 
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channel is unstable in high frequencies and several events can 
violate the efficient operation of the network, such as moving 
obstacles that can block the communication [16|. Specifically, 
we consider the natural situation where each client has to be 
associated to one of the wireless APs. This gives rise to a 
challenging mixed integer linear optimization problem, which 
is combinatorial and non-convex in general and thus hard to 
solve efficiently. Existing methods, such solution approaches 
for the generalized assignment problem in combinatorial op- 
timization ifTTl § 8], cannot be used because our main goal is 
to minimize the maximum AP utilization in the network and 
ensure a fair load distribution, which cannot be modeled by 
such a generalized assignment problem. Nevertheless, based 
on Lagrangian duality theory lf]~8l § 5] and on subgradient 
methods, we develop a new solution approach and derive a 
distributed and iterative algorithm for client association (DAA). 
We show the asymptotic optimality of the proposed solution 
method by an analytical bound on the duality gap. In addition, 
sufficient conditions under which strong duality holds are also 
characterized. The sensitivity of the convergence speed of 
DAA to the variation of the numbers of clients and APs is 
analytically investigated. Numerical simulations illustrate and 
compare DAA to benchmark algorithms, including the existing 
standard association approaches. 

Unlike the client association approaches in more traditional 
access networks fl9l . we take into account the load, the 
channel quality and the special communication characteristics 
of 60 GHz radio channel and we design a dynamic association 
mechanism that ensures balanced and fair load distribution 
among the APs. Global methods 112011 may be employed 
to find the solution of the combinatorial client association 
problem. However, global approaches have the drawbacks of 
1) the prohibitive computational complexity, even in the case 
of problems with few variables and 2) they are inherently 
centralized, which makes their distributed implementation very 
difficult if not impossible. In contrast, our proposed method is 
fast and can be implemented in a distributed manner, which is 
crucial for many applications. Last but not least, we discuss in 
detail the applicability of the association algorithm on top of 
the current standardized medium access control (MAC) mech- 
anisms. The model and the lightweight algorithm proposed 
in this work are general and can be applied with different 
existing MAC mechanisms for 60 GHz access networks, such 
as IEEE 802.15.3c and IEEE 802.1ad. 

The rest of the paper is organized as follows. In § HT1 
we present related approaches in literature. A description of 
the system model and the problem formulation is presented 
in § |ni] In § IIVI we give the general solution approach 
to the client association problem by using duality theory 
and subgradient method. In § [V] we describe the important 
properties of the proposed algorithm. In §[VI] numerical results 
are presented. Lastly, § I VIII concludes the paper. 

II. Related work 

During the last decade, resource allocation and in particular 
association/handoff control for wireless local area networks 
(WLANs) have been the focus of intense research. Several 
studies have analyzed the performance of the basic association 
policy that IEEE 802.1 1 standard defines based on the received 



signal strength indicator (RSSI). These studies have showed 
that this basic association policy can lead to inefficient use 
of the network resources lTT9l . Oil . ll22l . Therefore, there has 
been increasing interest in designing better client association 
policies. In what follows we summarize some representative 
approaches in literature. 

The authors in |22| study a client association policy that en- 
sures network-wide max-min fair bandwidth allocation to the 
users. The work in [23 1 presents self-configuring algorithms 
that provide improved client association and fair resource 
sharing. In |24| the problem of optimal user association to the 
available APs is formulated as a utility maximization problem. 
In |25l . neighbor and non-overlap graphs are utilized to reduce 
the probing latency. Moreover, the "multi-homing" scenario 
is introduced in 11261 . where the traffic is split among the 
available APs. In this approach, the throughput is maximized 
by constructing a fluid model of user population that is multi- 
homed by the available APs in the network. The system 
presented in |19| ensures fairness and quality of service 
(QoS) provisioning in WLANs with multiple APs. The authors 
compare their system to 802.11 and prove that 802.11 cannot 
support both fairness and QoS guarantees. 

Several approaches in literature jointly consider the associ- 
ation and handoff problems in WLANs. The authors of [27 1 
propose an association/handoff mechanism that is based on 
performance monitoring of the wireless links. In [21], there is 
a detailed analysis of the 802.11 association/handoff process. 
The presented approach optimizes the probe phase of the 
association process to reduce the probe latency. In [28 |, the 
authors propose protocols for an architecture called SMesh, 
which supports fast handoff introducing collaboration between 
the APs in a mesh network. The work in [29 1 presents a pre- 
authentication mechanism that speeds up the handoff process. 

Another line of research considers user service requests by 
readjusting the load across all APs 11301 . Therein, a network- 
directed roaming approach suggests to the clients the most ap- 
propriate network location to satisfy their requests. In tl3"Tl and 
l32l . a dual-association approach in wireless mesh networks 
is presented, where the APs for unicast traffic and the APs 
for broadcast traffic are independently chosen by exploiting 
overlapping coverage and optimizing the overall network load. 
Lastly, in 11331 . IT341 dynamic association and reassociation 
procedures are introduced with the use of the notion of the 
airtime cost. This metric reflects the uplink/downlink channel 
conditions and the traffic load in the network. The cross- 
layer extension of this mechanism considers the routing-based 
information from the mesh backbone. Accordingly, the clients 
optimize their association/handoff decision. 

The previous approaches are hard to apply in 60 GHz 
wireless access networks, due to the special characteristics 
of the 60 GHz channel, and the obvious differences with 
the rest wireless access technologies that we have previously 
mentioned. It follows that novel mechanisms must be designed 
to provide optimal resource allocation. These mechanisms 
must take into account the characteristics of 60 GHz wireless 
channel such as increased path loss, short range, fragile links, 
etc. Unfortunately, there is not much research in this field. 

Some recent interesting approaches on 60 GHz wireless per- 
sonal and local area networks have appeared in the literature. 
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The authors in [35 1, study the throughput of mmWave personal 
networks. A combination of carrier sense multiple access 
(CSMA/CA) and time division multiple access (TDMA) is 
used. A technique for optimizing the throughput of the network 
is proposed, where collisions are reduced in the CSMA/CA 
time-slots by a private channel release time. In [36 1, a MAC 
protocol that employs memory and learning to address deaf- 
ness, while exploiting the reduction of interference between 
simultaneous transmissions, is proposed with reactive interfer- 
ence management. The directionality and blockage problems 
of mmW networks are studied in [16|. A cross-layer approach 
is presented, where a single hop transmission is preferred when 
line of sight (LOS) is available and a relay node is randomly 
selected as an alternative. In ||37l , a resource management 
mechanism is proposed based on the exclusive region (ER) 
to exploit the spatial reuse of mmW networks. The authors in 
[38 1 describe an interference analysis framework that enables 
a quantitative evaluation of collision loss probability for a 
mmW mesh network with uncoordinated transmissions, as 
a function of the antenna patterns and spatial density of 
simultaneously transmitting nodes. Concurrent transmissions 
in 60 GHz wireless networks are studied in [39] by exploiting 
the spatial reuse and time division multiplexing gain. It is 
shown that the network throughput is improved compared to 
single hop transmission schemes. 

The current 60 GHz standardization bodies, such as IEEE 
802.1 lad and IEEE 802.15.3c, adopt the RSSI-based mecha- 
nism as the basic association/handoff functionality. However, 
RSSI is not an efficient decision metric for user association for 
several reasons. High RSSI values cannot univocally indicate 
high throughput. This is because RSSI not only depends on 
the distance from the APs, but also on the transmission powers 
of the APs. The accuracy of the RSSI-based technique is 
significantly affected by the high path loss, dispersion and 
directionality of the 60 GHz wireless channel. Moreover, since 
the wireless channel is a shared medium, throughput depends 
on the population of the cell served by the AP. An AP may 
become overloaded if a large number of clients are associated 
with it. Therefore, new metrics that better reflect the channel 
characteristics and the utilization of the AP are required. 

In contrast to the existing work in literature, this paper 
considers the special characteristics of the 60 GHz channel in 
an optimization problem where the objective is to minimize 
the maximum AP utilization in the network. Moreover, we 
design a lightweight distributed algorithm that balances the 
AP utilization by optimizing the client association process. 
We believe that this paper is the first to study such a fun- 
damental resource allocation problem in 60 GHz wireless 
access networks, especially when complex scenarios with 
many users and high traffic demands are considered irTOl , tTTTI . 
We propose a simple yet efficient solution and compare it to 
basic association policies, already in use in the present 60 GHz 
communication technologies under standardization (802.15.3c, 
802.1 lad). The work that is presented and evaluated in the 
forthcoming sections is complementary to the aforementioned 
resource management and scheduling approaches (the clients 
must first be assigned to the available APs and then the 
scheduling of the transmission can be handled under the 
established association state). 




Figure 1, Wireless access network: M = {1,2,3}, M = {1,...,10}, 
Mi = {1, ... , 6}, Mi = {4, . . . , 9}, M 3 = {3, 4, 9, 10}, M 3 = {1} for 
j e {1, 2}, Nj = {2} for j G {7, 8}, Nj = {3} for j = 10, Nj = {1, 2} 
for j € {5, 6}, AO = {1, 3} for j = 3, Afj = {2, 3} for j = 9, Mj = 
{1,2,3} for j = 4. The area inside the solid-lined circles around the APs 
1, 2, and 3 represent the transmission regions of each AP. The demanded data 
rate for client 3 is Q3 , the offered transmission rate from AP 1 to client 3 is 
-R13, and the offered transmission rate from AP 3 to client 3 is -R33. 



III. System model and problem formulation 

A 60 GHz wireless access network consisting of N APs 
and M clients is considered. We denote the set of APs by 
Af = {1, . . . , N} and the set of clients by M = {1, . . . , M}. 
The set of clients that can be associated to AP i is denoted 
by AAi. We assume that there are no isolated clients, i.e., 
MiU ■ ■ - U Mn = M. We denote by Afj the set of candidate 
APs that client j could be associated with. Figure [TJ shows an 
example access network, where the clients positioned inside a 
disc with radius r (centered at the location of AP i) can be 
associated with AP i. However, the disk-shaped region is only 
used for illustrative purposes. 

Each node (AP or client) is equipped with steerable di- 
rectional antennas and it can direct its beams to transmit or 
to receive. We assume that AP i can support all the clients 
in AAi with a separate transmit beam. We consider the case 
where receivers are using single-user detection (i.e., a receiver 
decodes each of its intended signals by treating all other 
interfering signals as noise) and assume that the achievable 
rate from AP i to client j 6 AAi is 



R i j = W\o %2 1 



(1) 



(N + Ij)W 

where W is the system bandwidth, P J3 is the transmission 
power of AP i to client j, Gij is the power gain from 
AP i to client j, Nq is the power spectral density of the 
noise at each receiver, and Ij is the interference spectral 
density at client j. All these assumptions are coherent with the 
literature and existing standards [6 |, [7 |. The power gain dj is 
modeled as in [39|. In particular, we use the Friis transmission 
equation together with the flat-top transmit/recieve antenna 
gain model ll40l . where a fixed gain is considered within the 
beamwidth and zero gain is considered outside the beamwidth 
of the antenna. In addition, we consider Rayleigh small-scale 
fading. Thus we have 



GTj C Gfj c \ 2 a i j 

167T 2 (dy/do)" 



I &Af,j G Mi 



(2) 



where Gj* is the transmit antenna gain from AP i to client j, 
Gf* is the receive antenna gain from AP i to client j, A 
is the wavelength, a,j is the fading coefficient which is an 
exponentially distributed random variables with unit mean to 
models the Rayleigh small-scale fading, d,j is the distance 
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between AP i and client j, do is the far field reference 
distance [41], 77 is the path loss exponent^] and Ij is the 
communication interference at client j. We capitalize on the 
well studied 60 GHz propagation characteristics [40 1, such as 
highly directional transmissions with very narrow beamwidths 
and increased path losses due to the oxygen absorption, in 
order to assume that the communication interference Ij is very 
small and does not affect significantly the achievable commu- 
nication rates in the network]! The achievable communication 
rates given in ([TJi are used to define the AP utilizations as 
described in the sequel. 

We denote by Qj the demanded data rate of client j. The 
channel utilization between AP i and client j is denoted by 
Pij and is given by the ratio of Qj and Rij, i.e., 



(3) 



Intuitively, the channel utilization /3y- gives an indication of 
the communications performance, in terms of the potential 
loading of the communication channel between AP i and 
client j. Thus, the sum of channel utilizations of AP i (or AP i 
utilization) is given by EjeM 4 Pij x ij> where (x l j) J&Mi are 
binary decision variables, which indicate the client association. 
In particular, for all i g Af and j G A4i 



1 if client j is associated to AP i 
otherwise . 



(4) 



AP i utilization is a metric reflecting the load. Our goal 
is to minimize the maximum AP utilization. Specifically, the 
problem can be formally express as 



minimize max J2jeM t PH 



Pij %ij (5a) 
subject to QjXij < Rij, i &Af,j € Mi (5b) 
£<e^ X H = 1. 3 G M (5c) 



€ {0,1}, j eM 7 ieJVj , 



(5d) 



where the variable is (a^ieA/". jeMi - The main problem pa- 
rameters are (l3ij)ieAf,jeMi! (QjjjeM, and {Rij)ieAf,jeMB 
The constraint (l5bl assure that the demand of client j is less 
or equal to the achievable rate from AP i to client j. This 
constraint can usually be satisfied due to the huge available 
bandwidth of mmW channel. The constraint ( T5cl > ensures that 
client j is assigned to only one AP. The constraint (l5dl 
indicates that the decision variables are binary. 

Note that problem (O differentiates from the generalized 
assignment problem ifTTl § 8], since the objective is to ensure 
fairness in the APs load distribution. Therefore, the existing 

l rj e [2,6] in IEEE 802.1 lad networks fl3l . 

2 In particular, a probabilistic analysis of the interference incurred due 
to uncoordinated transmissions is presented in [40 . It is shown that even 
uncoordinated transmission for different transmit-receive pairs leads to small 
collision probabilities and therefore, the links in the network can be considered 
as pseudo-wired. That is, interference can essentially be ignored in MAC or 
higher layers design. Similar assumptions are also supported by using efficient 
channel allocation in the network 1421 and efficient scheduling algorithms that 
supports concurrent transmissions 11391 . 

3 In general, client association affects the interference levels in wireless 
networks. However, when the characteristics of the 60 GHz wireless channel 
are considered (high oxygen absorption, etc) we can argue that the interference 
can be in the order of noise |4()j, which in turn allows us to suppress the 
dependence of interference on the client association and consider it fixed [ 39 ] . 



solution approaches IfTTl § 8, § 10] for the generalized as- 
signment problem, do not apply here. Moreover, the client 
association problem is combinatorial, and we have to rely on 
exponentially complex global methods lf20l to solve it, unless 
new methods are developed. In the sequel, we present one 
such efficient solution approach, which, although strictly non 
optimal, is asymptotically optimal when M grows. 

To conclude the problem formulation and motivate our solu- 
tion methodology, we emphasize that our problem formulation 
captures the general client association problem in wireless ac- 
cess networks. However, our model and the solution approach 
exploit the unique characteristics of 60 GHz wireless access 
networks, including the instability of the wireless channel in 
this frequency and the abrupt performance degradations (due 
to blockage, etc) that would require immediate resource re- 
allocation actions (such as dynamic association control). 

IV. Solution via dual problem 

Without loss of generality, we assume that /3y- < 1 for 
all i g Af and j G .MiQ As a result, the constrain (f5bT > is 
redundant and can be dropped. Thus, the standard equivalent 
epigraph form [fT8l § 4.1.3] of problem (O is 



minimize t 
subject to 

E 



<t, ieAf 



1, j G M 



e{o,i}, jeMtieAfj , 



(6a) 
(6b) 
(6c) 
(6d) 



where the variables are t and x = (:%)i£ J v', jeMi- We 
denote by p* the optimal value of the problem (|6). Note 
that problem (|6jt is a mixed integer linear program (MILP) 
and is feasible. The existing MILP solvers are, of course, 
centralized and are typically based on global branch and 
bound algorithms, where the worst-case complexity grows 
exponentially with the problem sizes fl8l § 1.4.2]. Even small 
problems, with a few tens of variables, can take a very long 
time to be solved. In the sequel, we apply Lagrangian duality 
to problem (O to develop a novel solution method that is 
distributed and fast. 



A. Dual problem 

Let us first form the partial Lagrangian by dualizing the first 
constraints of problem (|6). To do this we introduce multipliers 
A = (\i)ieN for the first set of inequality constraints. Thus, 
the partial Lagrangian is given by 



L(t,x, A) = t+ Y, M PijXij-t 



= t 1- E h + E E ftiA 



i£]\f 



(7) 



wireless channel can provide. In this case, we can collect all the i, j pairs 
for which fyj > 1 and modify the corresponding sets Afj and Mi, without 
affecting the optimal value of the original problem {5). In particular, if fiij > 
1, then we set Mj := Afj \ {i} and Mi := Mi \ {j}. For example, suppose 
that /J12 > 1, and the corresponding sets A/2 = {1, 2} and Mi = {1, 2, 3}. 
Then, we simply modify Afj and Mi as follows: A/2 = {2}, Mi = {1, 3}. 
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where the second equality follows from rearranging the terms 
and from the equivalence of the following two sets jf] 

{(ij) \i G M,j G Mi} = {{n,m) \m € M,n£ 7V m } . 

(8) 

Let g(X) denote the dual function obtained by minimizing the 
partial Lagrangian (|7) with respect to t and x. For notational 
simplicity, let us further denote by X the set of vectors x that 
satisfy the constraints (I6c]i-(l6dli of problem ©. In particular, 
X can be expressed as a Cartesian product of some sets Xj C 
,j EM, i.e., 



X = Xi x X 2 x ■ ■ ■ x Xm , 



where Xj is given by 



X; 



J2 Xij=l, Xij€{Q,l}, i&Afj 



Thus, the dual function is 

g(X) = inf L(t,x,\) 
JX ' ten v ' 

inf E E Pij*- 



xeX j£M ieJV 3 - 

— oo 



E A i = l 

iEAf 

otherwise 



jeM *i^ x . 



E inf E ftrfA 



E A 

ieAA 

otherwise 



1 



E <?, (*) E A* = 1 

jeA4 ieA" 

— oo otherwise , 



(9) 
• (10) 

(11a) 
(lib) 
(11c) 
(lid) 



where the equality (11 lbt follows from that the linear function 
t(l — EieAf Ai) is bounded below only when it is identically 
zero, the equality (11 let follows from (l9l-(Tl0t. and gj (A) 
in (11 ldt is the optimal value of the problem 



(12) 



minimize EiejV 3 Pij^i x ij 
subject to Xj € Aj , 

with the variable Xj. Even though problem ( TTZb is combina- 
torial, it has a closed-form solution given by 

(13) 







i = argmin/3 nj A n 

ng AG- 
otherwise . 



and is computable very fasfl Thus, the Lagrange dual problem 
is given by 

maximize g(X) = J2 eM 9iW ( 14a ) 
subject to EiGA^Ai = l (14b) 
Xi > 0, * g Af , (14c) 

where the variables is A. We denote by d* the optimal value 
of the problem (Tl4] i that will be useful later. Note that the 
Lagrange dual problem (TBI is a convex optimization problem, 
even though the primal problem © is not convex (see [18, 



5 This equivalence can be visualized by using a bipartite graph, where the 
nodes are the elements of two disjoint sets, the set of APs (i.e., AO and the 
set of clients (i.e., A4), and the edges are the potential AP-client associations. 

6 If X = argmin ng jy. n j(Xn + fi n j) is not a singleton, then an arbitrary 
i 6 X is chosen. 



DAA: Distributed algorithm for client association 

1 Initialization: The local channel utilizations, i.e., (0ij )je A4 are given, at every 
AP i. Set subgradient iteration index k — 1. Each AP i broadcasts the initial 
feasible prices to its local clients j £ A4i. 

2 Every client j sets A — and locally determines its association by solving 
problem U2I . Denote by AP ij the AP for which Xij — 1. 

3 Client j (g A4) signals only to AP ij an do not send any signalling to other 
AP i s, where i E A/j \ {ij }. 

4 Every AP i computes Uj by summing ftij over the clients j £ Mi, who had 
signalled in step 3, see \\5\ . 

5 Subgradient iteration: APs communicate and form u'^ by combining each Ui 
and by performing 1161 to compute A^^ 1 '. 

6 Stopping criterion: if the stopping criterion is satisfied, STOP. Otherwise, set 
k — k + 1, each AP i broadcasts the feasible prices A. ^ to its local clients 
j £ Mi and go to step 2. 

§ 5.2]). Let us next focus on the dual problem (Tl4l and its 
solution method, which allows us to find a good feasible 
solution to the original problem (|6j. 



B. Solving the dual problem via projected subgradient method 

The objective function g(X) is, in general, a non-smooth 
(therefore non-differentiable) concave function. A common 
approach to handle such non-differentiable functions is the 
subgradient method 11431 . because gradient based algorithms 
cannot be applied. Therefore, the projected subgradient 
method l43l . 11441 is used to solve the dual problem (fl4l . 

First, we denote by u a subgradient of —g at a feasible A, 
where u = (ui) ie j^. Specifically, 



E 



(15) 



where x*j for all j £ A4 and i G Afj is obtained as the solution 
of problem ( fT2l for all j e M. Thus the projected subgradient 
method is given by 



(16) 



where k is the current iteration index of the subgradient 
method, a k > is the fcth step sizfl and P is Euclidean pro- 



jection onto the unit simplex II = {A E 



gA£ 



A; 



l,Ai>0} 



(see [44, Excercise 2.1.12]). By employing ( TToT l in an iterative 
manner, we can solve the dual problem ( TBI . However, recover- 
ing a primal feasible solution is nontrivial because the original 
problem ((6]l is noncovex. A discussion of these nontrivial 
issues and how to find a good feasible solution is deferred 
to § IV-BI to maintain a cohesive presentation. Let us next 
describe how the computation of the solution of problem (fT~4T > 
is performed in a distributed manner. 

C. Distributed algorithm for client association (DAA) 

Recall that the dual function 5(A) is separable among 
the clients j G Ai, see the objective function J14at of 
problem ( TBI . Therefore, the subgradient components (Tl5i for 
the subgradient method (TToT l can be computed by coordinating 
the problem ( Tf2l for all j E A4. This suggests DAA presented 
at the top of this page. 

The first step initializes DAA. Step 2 represents the op- 
timization performed in a decentralized fashion by each 
client for fixed A. The optimization at each client j is a 
very simple operation and is to find the AP ij, where, 

7 We chose square summable but not summable step size (e.g., = a/k, 
where < a < 00), that guarantees the asymptotic convergence of the 
subgradient method 1431 . 
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ij = argmiriieAQ Pij^i ( see (ED)- Step 3 requires signaling 
between clients and APs. In particular, each client j signals 
only to AP ij. This signalling process can be performed very 
efficiently, e.g., binary signaling. As a result, we have a light 
protocol between clients and APs. In Step 4, each AP i locally 
computes m (see [TBI , which is the summation of over the 
client who signalled the AP. Step 5 requires AP coordination. 
In particular, APs coordinate to perform ([16}, which is the 
projection of a point onto the unit simplex. The result of this 
operation is given by the solution to a convex optimization 
problem, which can be carried out efficiently. Step 6, is the 
stopping criterion for the algorithm. If the stopping criterion is 
satisfied, DAA terminates. Otherwise, the algorithm continues 
in an iterative manner. In practice, a natural stopping criterion 
would be running it for a fixed number of iterations. 

D. Distributed implementation over existing standards 

Let us discuss now how the actual implementation of the 
proposed DAA algorithm could be achieved on top of the 
existing standards, IEEE 802.15.3c and IEEE 802.11ad. The 
distributed algorithm is performed periodically in the system 
to ensure the balanced operation of the network. The period of 
the execution is given by the control messages established by 
the medium access control protocol, as we see in detail below. 

The iterative association algorithm does not have to be ex- 
ecuted every time a client initiates an association or a handoff 
process. We assume that the newcomer client follows the as- 
sociation mechanism that IEEE 802.15.3c and IEEE 802. Had 
define, based on the RSSI. Then, our algorithm is periodically 
executed to correct possible suboptimal client associations 
in the network by reallocating the available resources. As 
mentioned before the distributed nature of the association 
algorithm is crucial, in the direction of offloading the APs 
and make good use of the small computational resources that 
the clients may provide. Both 802.15.3c and 802.1 lad define 
control frames (denoted as beacon frames) that are periodically 
broadcasted by the APs in the network. The APs can utilize 
these frames to trigger the initialization of DAA and carry 
the required information to the clients. The APs inform their 
clients about the initialization of DAA by setting a special bit 
into the beacon frame. Thus, the clients are ready to cooperate 
towards the optimal resource allocation in the network. The 
information required by the algorithm can be carried in the 
control frames or piggy-backed to the data frames that the 
APs send to the clients [42 1 . Moreover, the clients are piggy- 
backing the information in the data frames that they send to 
the APs. Thus, the algorithm is executed in perfect harmony 
with the networking protocols, without interrupting the actual 
network operation (data communication) and without causing 
extra delays. 

V. Algorithm Properties 

In this section, we first show the convergence performance 
for the proposed algorithm. Then we show how to recover 
a good primal feasible solution. Next, we highlight some 
sufficient conditions under which strong duality holds for the 
MILP © followed by a couple of examples. Finally, we show 
analytically the asymptotic optimality of the algorithm, where 



the relative duality gap diminishes to zero as the number of 
clients in the system grows. 

Recall that p* is the optimal value of the original MILP (O 
and d* is the optimal value of the associated dual problem fl4l . 
We refer to p* as the primal optimal value, d* as the dual 
optimal value, (p* — d*) as the optimal duality gap, and (p* — 
d*)/p* as the optimal relative duality gap, which are useful 
in the rest of the paper. 

A. Convergence 

DAA essentially solves the dual problem (fT~4-b by using the 
projected subgradients method and the convergence of the 
algorithm is established by the following proposition: 

(k) 

Proposition 1: Let gl' t denote the best dual objective 

( k) 

value found after k subgradient iterations, i.e., finest = 
max{ ff (A (1) ), . . . , 5 (A (fc) )}. Then, Ve > 3n > 1 such that 
Vkk>n^(d*-gW t )<e. 

Proof: The proof is built on the material presented in [43. 
§ 3.2] |44l . Let us denote by A* the optimal solution of dual 
problem (fl4l l. Thus, we have 

|| A (fe+D _ A *||2 = || P ( A « _ afcU W) _ A*l|| (17a) 

<||(A«-a fe u«)-A*||2 (17b) 

= ||A (fc) -A*||2-2a fe u( fe ) T (A (fe) - A*) 

+ a 2 fc ||u«||i (17c) 
<\\\^-y\\i-2a k (g(\*)-g(\ {k) )) 

+ ai\\uW\\l (17d) 
= \\\^~\*\\ 2 2 -2a k (d*-g(\^)) 

+ al||uW||I, (17e) 

where (1 1 Tat follows from ([TBI , (I17bb follows from that the 
projection onto unit simplex II always decrease the distance 
of a point to every point in II and in particular to the optimal 
point A*, d 17db follows from the definition of subgradient, 
i.e., -g{\*) > -g{\ (k} ) + uW T (A* - A (fe) ), and (fTTel 
follows from that d* = g (A*). Recursively applying d 17et 
and rearranging the terms, we get 

2 £ ai (d* - g(\®)) = -||A( fc+1 ) - A*||l + ||A« - A*||I 
i=i 

+ £a?||u<*>||2 (18a) 
i=i 

<R 2 + G 2 J2 k =i af , (18b) 

where ( fl8bi > follows from that ||A (fe+1) - A*|| 2 > 0, ||A - 
A*||2 < R = \/2 for any A e II, and the norm of any 
subgradient u of — g at any A € II is bounded, i.e.0 

IM| a <g = v^a^E^A,) 2 - d9) 

Moreover, clearly we have 

d*-g ( £ st <d*-g(\^), l = l,...,k. (20) 

Thus, from ( I18bb . (1201 . and by noting that step size ai = 
a/l, < a < oo is square summable (i.e., = 

8 Compare to (BJ. 
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a 2 7r/6), we obtain an upper bound 



on d~ 



(fc) 

5bcst 



as 



{k) < ( J R 2 + G 2 Et 1 «?)/(2Et 1 «/) (21a) 



< (R 2 /2 + a 2 G 2 7r/12)/eti «i) • (21b) 



Since E/=i a i 15 strictly monotonically increasing in k (it 
grows without bound as k —> oo), for any e > we can 
always find a integer n > 1 such that Ez=i cti > e (R 2 /2 + 
a 2 G 2 7r/12) if k > n, which concludes the proof. ■ 
The bound derived in (I21bl i allows us to predict some key 
behaviors of the convergence of the proposed algorithm. To 
see this, we note from (fT~9b that the numerator of the bound 
depends on (/3y)j g ^ i jeMi for fixed a. Now suppose that 
the number of clients increases. This forces G to increase as 
well. As a result, the corresponding total iterations to reach 
the given accuracy e will also grow. Roughly speaking, this 
means that, for fixed number of APs, the larger the number of 
clients is, the larger the total number of iterations required for 
the convergence of DAA. On the other hand, suppose that the 
user distribution is such that J^jeM Pij I s roughly the same 
for each AP i. Thus, if the total number of APs is increased, 
then G will become larger and as a result, the total number of 
iterations to convergence is increased as well. These algorithm 
behaviors are numerically illustrated in § IVII 

B. Recovering a feasible primal point 

As we discussed in § I V-Al we can solve the dual problem to 
any given accuracy to yield the dual optimal value d* and the 
dual optimal solution A*. If the primal problem is convex, from 
d* and A*, we can usually obtain the primal optimal value p* 
and primal optimal solution (i*,x*) lfl8l § 5.5.5]. However, 
recall that the original MILP © is a nonconvex problem. 
Therefore, unlike convex problems, there is no guarantee that 
from d* and A*, we obtain p* and (£*,x*)0 Nevertheless, in 
the case of MILP ©, a primal feasible point is obtained during 
each iteration k of the algorithm (see step 2 of the algorithm). 
Thus, it is natural to go for the best choice, among all the 
primal feasible points obtained so far. For example, a good 
approximation for the primal optimal value would be 

^^{fW,...,^}, (22) 

where (i( fe ),x( fc )) is the primal feasible point in iteration fcF°l 

A good feasible point would be (tl^L, x wt)' which is the 

(fcl 

primal feasible point that corresponds to W, est . 

(k) 

Even though the value p h( ! st is not usually as good as the 
primal optimal value p*, monte Carlo simulations show that it 
is a good approximate value with k on the order of hundreds 
or more, e.g., k > 100 (see § IVTb . There is no clear analytical 
explanations of these fortuitous encounters, especially because 
the original problem © is nonconvex 11441 § 6.3]. 

C. Duality gap 

The duality gap (p* — d*) is one of the important metric 
that can be used to quantify the performance of the proposed 

'The first component t* of the primal optimal solution of MILP {6} and 
the primal optimal value p* are clearly the same. 

10 APs can compute (t^ k \x^) easily by using the client signaling they 
received at step 2 and the AP coordination at step 5. For example, = 
ll-ii( fc )ll._. 



DAA. Note that, in general, the duality gap for MILP © is 
not zero, because the problem is nonconvex. Therefore, it is 
not surprising that deriving general conditions under which the 
strong duality for MILP © is very difficult. Nevertheless, we 
first provide some examples to highlight sufficient conditions 
for strong duality for MILP ©. The latter part of this section 
derives an analytical bound on the duality gap. Moreover, 
we show the asymptotic optimality of the algorithm, where 
the relative duality gap (p* — d*)/p* diminishes to zero as 
the number of clients in the system grows. Such asymptotic 
results are indeed important from a theoretical, as well as 
from a practical perspective, see for example the duality results 
associated with the well known Knapsack problem |44|. 

The following proposition establishes a simple result, which 
is instrumental to study zero duality. 

Proposition 2: Let p* c i ax denote the optimal value of the 
linear programming (LP) relaxation of problem ©, i.e., 

minimize t (23a) 

subject to 2~2jeMi <% X H - *> * e ^ ( 23b ) 

Ei6A/i ^ = X > 3 € M < 23C ) 

< Xij < 1, j G M, i S Nj , (23d) 

with variables t and x = {xij)i £ j^ : jeMi- Then d* = _p* i ax - 

Proof: Note that problem © is always feasible. The proof 
is based on two key results: 

(a) for problem (l23l . we always have strong duality, i.e., 

Prclax = d rcla X < and 

(b) the dual of problem (l23l is identical to problem (fT4l . 
and therefore <i* c i ax = d*, where d* clax denotes the dual 
optimal value of problem ( f23l . 

In particular, (a) is guaranteed from strong duality results for 
linear programs, see ifTS] § 5.2.3]. To prove (b), we first note 
the following: the partial Lagrangian obtained by dualizing 
first constraint of problem (l23l is identical to (|7), where we 
consider same notations for the dual variables. This is a slight 
abuse of notation, but it helps in the clarity of the exposition. 
Let h(\) denote the dual function obtained by minimizing the 
partial Lagrangian as given by [compare to ([Til l 



, . [ E MA) EieJV A * = 
h(X) = < j&M 

— oo otherwise 



1 



(24) 



where hj(X) is the optimal solution of a problem very similar 
to (fT2l , except that the constraint x 3 € Xj is replaced by 
Xj £ conv(Aj). Note that com{Xj) £ K nj is a unit simplex, 
and therefore the optimal value hj(X) is attained at one 
of the vertexes Xj = (xyJieM °f the unit simplex [45 
Corollary 32.3.4]. Specifically, the components of the vertex 
Xj is identically given by ( fT3l >. As a result, hj(X) — gj(X) 
for all jeM and h(X) = g(X). ■ 
Note that if problem d23l , the LP relaxation of problem ©, 
has integer solutions, then we can easily show that the optimal 
value p* of the original MILP © is equal to the optimal value 
Therefore, from Proposition [2] we 



Prcla 



i.e. 



P = Prcla 



have p* = d*. In other words, if problem (1231 has integer 
solutions, then strong duality holds for the original MILP ©. 
Thus, it is natural to investigate the conditions, under which 
the LP ( f23l has integer solutions. Roughly speaking, there 
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Figure 2. Examples of network topology 1 (a) The initial communication 
graph; (b) The optimal association 

are not many results that establish conditions on LPs, beyond 
total unimodularity [46, § 9] of the associated problem matri- 
ces, under which they have integer solutions. Unfortunately, 
particularized to LP (1231 . the related matrices are not total 
unimodular, and therefore the theoretical implications of total 
unimodularity does not applies ll46l § 9]. Nevertheless, Propo- 
sition fallows us to imagine special cases of MILP (O, where 
we have strong duality. We now present two such examples. 



Example 1: Suppose that the initi al clients and APs commu- 
nication is as shown in Figure |2(a)| In particular, there are M 
clients and N = M APs. Each client j e {2, 3, ... , M} can 
be associated to either AP j — 1 or j. However, the first client 
can be associated only to the AP 1. Moreover, suppose that 
Pii = P £ (0; 1 ] f° r a11 3 6 and the remaining 

PijS can be arbitrary values from the range (0, 1]. 

We can easily show that the optimal client association (i.e., 
the optimal solution of MILP (6]l) corresponds to Figure |2(b)| 
Moreover, we have p* — f3. Let us now focus to the LP relax- 
ation l !23t applied to the network in Figure [2(a)! In this case, every 
client j, except the client 1, can be associated to both AP j — 1 
and AP j. However, we can prove by contradiction: the solution 
of pro blem l!23t corresponds to the optimal association depicted in 
Figure [2(b)] Thus, we have p* = p* e i ax . From Proposition^ we 
have strong duality for MILP (i.e., p* = d* ) and the proposed 
algorithm achieves d* . 



Example 2: Consider the network shown in Figure |3(a)| There 
are N APs and two types of clients connected to APs. The Type 1 
clients can communicate only with a single AP. For example each 
client j £ {1, . . . , Ji} can communicate only with AP 1, and 
therefore, they must be assigned to AP 1. On the other hand, 
the Type 2 clients can communicate with all the APs, e.g, clients 
Jjv + 1, • • • , M. As a result, Type 2 clients can be associated to any 
AP. Now suppose that j3ij values associated with Type 1 clients are 
such that E/ix Pu = ££ Jl+ i to • • • = E/f j N _ 1+1 fai = 
B 6 1R+ and /3y values associated with Type 2 clients are all 
equal to /3 6 (0, 1]. Moreover, suppose that the number of Type 2 
clients is a multiple of N, i.e., M — Jn = rnN for some m G 7L+. 

The optimal client association (i.e. , the optimal solution of 
MILP (6]l) corresponds to Figure |3(b)| where the Type 2 clients 
are equally distributed among the APs. In particular, each AP is 
associated with m Type 2 clients and we have p* — B + m/3. Let 
us now consider solution given by the LP relaxation (f23j. note that 
there is no choice for Type 1 clients, other than associating them 
to the only AP they can communicate. From the symmetry, we 
can easily see that associating each Type 2 client j among all the 
APs i = 1, . . . , N with equal shares is a particular solution to the 
LP relaxation ((23}, i.e., for every Type 2 client j, Xij = (1/N), 
i = 1, . . . , N. Thus, at each AP i, the utilization corresponds 
to Type 1 clients is B and the utilization corresponds to Type 2 
clients becomes (M — Jn)(1/N), which is identical to m (recall 
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Figure 3. Examples of network topology 2 (a) The initial communication 
graph; (b) The optimal association 




(a) (b) (c) 

Figure 4. Example simulation topologies (a) 2APs; (b) 3APs; (c) 5APs 

M — Jn = mN). Therefore, we have p* c i ax — (B + m) = p* 
and strong duality holds for MILP ((6}. 

The examples above give insights into the algorithm's 
behavior in spacial cases. However, they can be used to build 
intuitive ideas of general networks. It is indeed important to 
analyze the proposed algorithm's behavior in general as well. 
In the sequel, we provide theoretical substantiation that allows 
us to predict the general algorithm properties in terms of the 
optimal duality gap and the relative duality gap. 

The following theorem formally establishes a bound on the 
duality gap and the asymptotic optimality of DAA. 

Theorem 1: The optimal duality gap of mixed integer linear 
program © is bounded as follows: 



p* - d* < (N + 1)(q + max Qj) , 
jeM 



(25) 



where q = ma,x ieJK fj eMi fcj and Qj = min ieA /v fy. More- 
over, the relative duality gap diminishes to as M oc. 

Proof: See Appendix lAl ■ 
The theorem above suggests that the duality gap is always 
bounded by a constant that does not depend on the number 
of clients in the system. Note that the bound grows like N, 
and therefore we can expect an increase in the duality gap for 
larger N. The theorem states that for larger M, the relative 
duality gap become almost zero. These are very important to 
get an insight of the behavior of the proposed algorithm in 
general networks, see § |Vl]for numerical examples. 

VI. Numerical Examples 

In this section we present the numerical evaluation of the 
proposed algorithm in a multi-user multi-cell environment. We 
compare DAA to: a) random association policy, b) RSSI-based 
policy, which is the association mechanism used in standards, 
and c) optimal solution of the optimization problem © using 
IBM CPLEX optimizer B71 . 

We define the SNR operating point at a distance d [distance 
units] form any AP as 

P \ 2 /(mn 2 N W) d<d a 
P \ 2 /(16ir 2 N W) ■ {d/d^y 11 otherwise . 



SNR(d) = 
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Figure 5. Influence of the number of clients on the convergence (a) Average 
objective value p( fc ' vs. iterations k, 5 APs, 100 clients; (b) Average objective 
value pW vs. iterations k, 5 APs, 200 clients 
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Figure 6. Influence of the number of APs on the convergence, (a) Average 
objective value pW vs. iterations k, 3 APs, 30 clients; (b) Average objective 
value p( fc ' vs. iterations k, 10 APs, 100 clients 



Circular cells as depicted in Figure Q~]are considered, where the 
radius of each cell r is chosen such that SNR(r) = 10 dB. The 
APs are located such that the distance between any consecutive 
APs is D = l.lr. For example, Figure |4(a)1 shows the case 
for TV = 2 APs, Figure |4(b)| shows the case for N = 3, 
and Figure |4(c)| shows the case for N = 5. The clients are 
uniformly distributed among the circular cells and the potential 
AP-client association (i.e., Aii and Afj) is found as pointed 
out in Figure Q] 

We set A = 5 mm, N a = -134dBm/MHz, W = 1200 MHz, 
and do = lm, see (Q]i and Moreover, for all j g A4, we set 
L = and for all i G TV, j e Mi 



and Gj* 



we set Pij = Pq = O.lmW 



1. In order to check the average perfor- 
mance of the algorithms, we consider T = 1000 time slots, 
where the fading coefficients ay for all i € TV, j £ A4i and 
the demanded data rates Qj for all j £ A4 are constant during 
each time slot T E {1, ...,T} and independently change 
from one slot to another. In particular, the exponential random 
variables with unit mean are independent and identically 
distributed over the time slots. Moreover, we assume that Qj 
are uniformly distributed on [0, 400]Mbits/s and independent 
and identically distributed over the time slots. 

To see the average convergence behavior of the proposed 
algorithm, we consider the average primal objective value of 
problem (|6]l obtained by DAA. In particular, the average primal 
objective value from DAA after k subgradient iterations is 
defined as - (1/T) E^i^CO. where PbL( T ) is 



the best primal feasible objective value of problem (|6]l after 
k iterations at time slot T, see (1221^ The average objective 
values from benchmark algorithms, random association policy, 
RSSI policy, and the optimal policy are defined in a similar 
manner, and are denoted by P r and> Prssi* an d p *> respec- 
tively0 Moreover, the average dual optimal value obtained by 

DAA is D* = (1/T) J2t=i d *( T X where d *( T ) is the optimal 
objective value of dual problem ( TBi i at time slot T. 

Figure [5] shows p( fc ' versus subgradient iteration k, for the 
cases where N — 5, M = 100 (Figure |5(a)| i and N = 5, 
M = 200 (Figure [5 (b)| i. Results show that there is a noticeable 
effect of varying M (the number of clients in the system) on 
convergence time. In particular, the convergence is faster for 
smaller M. This observation is consistent with our analytical 
study presented in § IV-AI Proposed DAA clearly outperforms 
the RSSI policy used in 802.11, 802.15.3c and 802.1 lad, 
as well as the random policy. For example, DAA, yields a 
performance improvement of about 20% compared to RSSI 
policy in both considered cases. The gap between P* and p( k \ 
even after relatively larger the number of subgradient iterations 
(e.g., k = 300) is indeed expected due to the nonconvexity of 
the original problem (|5), see § IV-BI Nevertheless, average dual 

"Due to the nonconvexity of the original problem {5), p\^ st (T) is not 
necessarily achieves the optimal value even when k — > oo. 

12 Since the benchmark algorithms do not depend on subgradient iteration k, 
like p( fe ', there is no superscript (k) required for P ran d, Prssi, and P*. 
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optimal value D* from DAA is almost equal to the optimal P*. 

Figure [6] shows p( fc ' versus subgradient iteration k, for the 
cases where N = 3, M = 30 (Figure [6(a)) and N = 10, M = 
100 (Figure |6(b)| i. Here, the clients density or the number of 
clients per AP is roughly the same (i.e., 10). Results show 
that there is a clear effect of varying N (while keeping client 
density fixed), on convergence. In particular, the convergence 
is faster for smaller N. This observation is inline with our 
analytical study presented in § IV-AI The performance of other 
benchmark algorithms are very similar to those in Figure 

Figure Q shows the average objective from DAA after 
K = 1000 subgradient iterations, versus the number of 
clients M for the cases N = 2 (Figure |7(a)| i and N = 10 
(Figure |7(b)| i. Plots for the benchmark algorithms are also 
depicted. Results show that the average objective values as- 
sociated with each algorithm increase as M increases. This is 
intuitively expected because APs become more loaded as the 
number of clients grows. Results further show that p( K *> that 
came from DAA is very close to the optimal P* in both cases 
and the performance gap is not sensitive to changes in M. Note 
that the performance of the random and the RSSI policies are 
substantially low and their performance degradation becomes 
even noticeable for larger M, see Figure |7(b)| As expected 
D* provides a global lower bound on the performance, and is 
hardly distinguishable from P*. 

Figure [8] shows the average objective value versus the 
number of APs N for the cases where M = 40 (Figure |8(a)| i 



and M = 100 (Figure |8(b)| i. The performance ranking of 
the algorithms is very similar to Figure [7] Results show that 
the average objective values decrease as M increases. This 
is intuitively explained by noting that, the larger the N is, 
the smaller the client density is, and therefore the smaller the 
average objective values of each algorithm becomes. Results 
again shows that DAA performs close to the optimal and 
outperforms the random and RSSI policies noticeably. 

To see the effect of the increasing number of clients on 
the relative duality gap (see § IV-CK we define the met- 
ric average relative duality gap, Ave-RDG. In particular, 

Ave-RDG = (1/T)Et=iO*( T ) - d *i T ))/P*( T l where 
p*(T) is the optimal value of primal problem © and d*(T) 
is the optimal value of dual problem (fl4l >, at time slot T. 
Moreover, we denote by Ave-RDG-best-achieved, a 
related metric defined very similar to Ave-RDG, except that 
p*{T) is replaced with Pu^ t (T), i.e., the best primal feasible 
objective value achieved from DAA after K iterations at time 
slot T. 

Figure [9(a)| measures the percentage Ave-RDG versus M 
for different TVs. For all considered Ave-RDG, N approaches 
to zero as M increases. This is consistent with our analytical 
results established by Theorem\J\ see 5 IV-CI Figure [9(b)| shows 
that the plots of percentage Ave-RDG-best-achieved 
versus M are very similar to those in Figure |9(a)| This 
behavior is not surprising, because the performance of DAA 
is very close to the performance of the optimal method, see 
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Figure 9. (a) Average relative duality gap Ave-RDG vs. the num- 
ber of clients M; (b) Average best achieved relative duality gap 
Ave-RDG-best-achieved vs. the number of clients M 
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Figure 10. (a) Average optimal duality gap Ave-DG vs. the number of clients 
M; (b) Best achieved average duality gap Ave-DG-best-achieved vs. 
the number of clients M 



Figure Q and Figure [8] 

Figure [10] depicts the dependence of the average duality gap 
on M. In particular, we define the optimal average duality 
gap as P* — D* and we plot Ave-DG versus M are shown in 
Figure |10(a)| Moreover, we define the best achieved average 
duality gap Ave-DG-best-achieved as p( A ) — d*. The 
corresponding plots are shown in Figures |10(b)| In both cases, 
results show that there is no apparent effect of the varying M 
on the duality gap. Nevertheless, as we discussed in § IV-CI 
(see d25ll), the average duality gap grows when N increases. 

To examine the fairness of the final client association 
among the APs, we consider the well known Jain's fairness 
index |48l as the fairness metric. We denote by j( fc )(T) 
the fairness level resulted by the proposed DAA at time slot 
T after k iterations. In particular, we define jW(T) = 
{^eMY^iT)) 2 /{NY, t ^Yt\T)% where Y?\T) = 

J2jeMi Pij x i? ( T ) with x fj ] ( T ) bein S the best feasible solu- 
tion resulted from DAA at time slot T and after k iterations. 
The average fairness index JW resulted from DAA after k 
iterations is simply defined as = (1/T)X)t=i J [k) {T). 
The average fairness indexes resulted from the benchmark 
algorithms, the random association policy, the RSSI policy, 
and the optimal policy are defined in a similar manner, and 
are denoted by J ran d, Jrssi, and J*, respectively^ 

''Benchmark algorithms do not depend on subgradient iteration k. There- 
fore, like j( k \ there is no superscript (fc) required for J ra nd, Jrssi, and J*. 



Figure fTTJ depicts jW versus k compared to the benchmark 
fairness indexes J ra nd, Jrssi, and J* for the case where 
N = 5 and M = 100. Note that the fairness index ranges 
from 1 /N (worst performance) to 1 (best performance). As ex- 
pected, the optimal association provides the best performance. 
Results show that within a few hundreds of iterations, DAA 
achieves a fairness level very close to the optimal. Results 
further show that DAA significantly outperforms the random 
and the RSSI policies. 

In order to provide a statistical description of the speed of 
the proposed algorithm, we consider empirical the cumulative 
distribution function (CDF) plots. Specifically, for each time 
slot T € {1, . . . ,T}, we store the total CPU time required for 
DAA to find Pu^l(T). For comparison, we use the total CPU 
time required to find the optimal value p*(T). Figure [T2(a)| 
shows the empirical CDF plots of the number of iterations 
for M = 100, 200, 300, with N = 10. In the case of DAA, 
the effect of changing the problem size by increasing M 
on the CDF plots are almost indistinguishable. However, in 
the case of optimal method, there is a prominent increase 
in the time required to compute p*(T). Figure [T2(b)| depicts 
the average time required by DAA and the optimal method 
versus M. Results show that the average time required by DAA 
to find possibly a suboptimal solution is not sensitive to the 
variation of M, and is almost zero. However, the average time 
required by the optimal method to find the optimal solution 
grows approximately exponentially with M, This is certainly 
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Figure 11. Average fairness index vs. the iterations k, 5 APs, 100 clients 

expected because problem © is combinatorial, and therefore 
the worst-case complexity of the global method (CPLEX) 
grows exponentially with the problem size lfl8l § 1.4.2]. Thus, 
there is naturally a tradeoff between the optimality and the 
efficiency of the algorithms. Nevertheless, Figures 171 l8l and [T2l 
and the asymptotic results in Figure |9]indicate that DAA yields 
a good tradeoff between the optimality and the efficiency, 
which are favorable for practical implementation. 

VII. Conclusions 

In this paper we considered the problem of optimizing 
the allocation of the clients to the available APs in 60 
GHz wireless access networks. The objective in our problem 
formulation was to minimize the maximum AP utilization in 
the network. The optimization problem was combinatorial. 
Thus, we proposed a distributed association algorithm (DAA), 
based on Langragian duality theory and subgradient meth- 
ods. DAA is fully compliant with the existing WiFi and 60 
GHz protocols/standards and it can be easily implemented 
on top of the MAC mechanisms that they define. We studied 
the behavior of DAA through theoretical analysis, where we 
proved its asymptotic optimality properties. Moreover, we 
presented a numerical analysis, where DAA was compared 
to other association policies in realistic scenarios. We tested 
convergence, scalability, time efficiency, and fairness. Our 
results indicate that the proposed solution could be well 
applied in the forthcoming 60 GHz wireless access networks. 

Appendix A 
Proof of TheoremQ] 

The proof is based on a proposition from [49], which we 
restate here for clarity and for simplifying the presentation. 

Proposition 3: Consider the following possibly nonconvex 
problem: 

minimize J2jeJ fj&i) ( 26a ) 
subject to yj e yj ■ , j e J (26b) 
£ i6i7 hi(yi)<b, ( 26c > 

where the variables are y 3 £ H Vj . The problem parameters 
J = {1, . . . , J}, b is a given vector in IRP, is a subset 
of RW, and f 3 : conv(^) -> H and hj : conv(^) -> R Q 
are functions defined on the convex hull of 3^ . The following 
assumptions hold for the primal problem 



0.9- 
0.8 
0.7 
0.6 

i 0.5 

i 

0.4 
0.3 
0.2 
0.1 
0, 







: _ 














































DAA, M=1Q0 

optimal (CPLEX), M=100 

DAA, M=2Q0 

optimal (CPLEX), M=200 










DAA, M=300 
optimal (CPLEX), M=300 



2 3 
time, t (sec) 



(a) 



2500- 



~ .2 2000 

C ^ 
:= O 

« 

1 | 1500 



« E 



500 



-0- optimal (CPLEX) 
— H — DAA 






































































5 













1 00 1 50 200 250 300 350 400 450 500 
total number of clients, M 

(b) 

Figure 12. (a) Empirical CDF plots of Total CPU time, N = 10; (b) Average 
time to find optimal/suboptimal solution, JV = 10 



Assumption 1: There exist at least one feasible solution of 
problem (1261 , 

Assumption 2: For each j, the subset of 



{(y,.i.,syj../; ( !yji y, • y,\ 



(27) 



is compact. 

Assumption 3: For each j, given any vector y in convQ^), 
there exists y € 3^ such that hj(y) < h 3 (y), where 
hj : conv(y,) — !• 11*3 is the convexified version of b, on 
conv(3^ ) and the notation "<" here means the componentwise 
inequality. In particular, for all y G conv(D^) 



y fc e yj , Ek=i ah = i. « fc > °} • ( 28 ) 



Moreover, consider the dual problem of 



i.e.. 



maxmize d(v)= inf <^ J2 [fifa) + ^oiYj)} - 
subject to v > , 

with variables v — (y\, . . . , vq) £ MP, Then we have 



P* - D* < (Q + l)maxp, 



(29) 
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where P* denotes the optimal value of problem d26l i, D* 
denotes the optimal value of the dual problem ( |29l i, and pj 
is a nonnegative scalar such that 



Pj < sup fj( yj ) - inf fj( yj ) 



(30) 



Proof: We do not reproduce the proof here, but refer the 
interested reader to ||49l § 5.6.1, pp. 371-376] for a rigorous 
proof and to fl4l § 5.1.6] for an intuitive explanation. ■ 

Now we rely on Proposition \3\ above to prove Theorem Q] 
The key steps of the proof are: (a) We equivalently reformulate 
MILP i© in the form d26l i and (b) We show that the Assump- 
tions 1-3 of Proposition |5] hold for this equivalent problem. 

Let us start by considering the following problem which is 
closely related to MILP ©: 



minimize 
subject to 



^2jeM Pa x ij — Ej 

x i:j G {0, 1}, j G M,i£Af 



eM tj > 



(31) 



0<tj< i max 



j x rijj> 



jeM , 



where the variables are t = (t\,---,tM) and x 
{ x ij)ieN, jeM- The problem and 7^ are defined as 







ieAf, jeM t 
otherwise 



la 



1 jeM, ieAfj 

otherwise 



rij — argmirij e< Y,. and i max < 00 is an upper bound on 
tj, the optimal solution component of problem (|3T1 > that corre- 
sponds to tj. For example, we use t max = ma,x ieJ ^_j eM fifj, 
throughout this paper. We can easily show that problem (I3U 
is equivalent to original MILP © and the optimal value P* 
is equal to the optimal value p* of MILP ©, i.e., 

P* =p* . (32) 

We refer to problem (|3H as the modified MILP, which is in 
the form d26| >, where 

1. J = M and J = M, 

2. yj = (zj,tj) G IR y \ with Zj = (x tj ) ie ^/ and yj = 
N + l, 

fj(yi) = tj, 

yj = {{{ x ij)iEN',tj) I SigjV "lij X ii = X ij ^ 

{0,1}, i£Af_, tj G [O^ + jSn^j-]}, 

5 - h i(yj) = ((Aj^y - tj), • • •! (PNjXNj - tj)) g ir 13 , 

with Q = N, and 



3. 
4. 



6. b = 0. 

Let us now show that the Assumptions 1-3 of Proposition^ 
hold for the modified MILP PTT i. It is straightforward to see 
that Assumptions 1-2 hold. Checking whether Assumption 3 

holds is less trivial as we show next. 

Let y be any given vector in conv(3^ ). From the definition 
of hj(y) (see (l28li). we can express it as 



h,(y) = E «* h i(y*) 



(33) 



fc=i 



for some y fc G D^j and « fc such that y = EfcLi 



1 h h 

ay, 

Efc^i 1 ak = 1; afc — 0- Particularized to the modified 
MILP (O, we can write the set of equations ( !34at -( 34g I, where 
t~ = t max + (3 nj and y = (0, 0, . . . , 1, 0, . . . , 0,t™ ax ), 
which is feasible, i.e., y G 3^' and Assumption 3 holds. 
Note that the first three equalities (I34at - (l34cb follows from 
straightforward manipulations, inequality (I34dt follows from 
that tj < i max + PnjjX^.j for all k, inequality d34e| > follows 

from that J2k( akx ij) — ® f° r a ^ ^, anc ^ ^ ast two equali- 



ties (|34fT >-( [34g] > follow from straightforward manipulations. 

Finally, let us show that D* = d*, where D* is the 
dual optimal value of the associated dual problem [compare 
with <|59)] of the modified MILP {3B- We denote by P* elax 
the optimal value of the LP relaxation of (f3~TT >. Thus, we have 



D* 



P* 

relax 



Prcla 



d* , 



(35) 



where the first equality follows from a similar approach as 
described in the proof of Proposition [2] the second equality 
follows from that the LP relaxations of problem (13H and of 
problem © (see problem ( |23l ) have the same optimal value, 
and the last equality follows from Proposition [2] 

By using d32l . ( 1341 ). and that Assumptions 1-3 hold for the 
MILP (O together with Proposition \3\ we have 



p* -d* < (iV + l)maxp, 
jeM 

< (N + l)max(f 
jeM K 



(N + l) g + 



max Pi 
jeM J 



(36) 



where the first inequality follows from (129) , the second 
inequality follows from (l30i >. supt, = t max + (3 nj j, and 
inftj = 0, and the last equality follows from that i max = g 



and (3 nj j = Qj, which yields ( l25| i. 



hi(y) = E« fc (A^y-*i»- 

k 

fc 



ifirijj x nj 'j 

otx 



tj , . . . , j3j^jX k N j tj) 



, f3 Nj a k x k N] 



ah)) 



V fe fc fc 

> (^y0 - ^ ax , • • • , -* maX , ■ • • , Av,-0 - ^ ax ) 



(-r 

h,(y) 



-C ax ) 



(34a) 
(34b) 



A, - E(« fc ^), • • • E(" fe <i) - E(« fc ^), • • .Jnj E(a k x h Nj ) - E(« fe ^)J (34c) 

' ^E(a fe <i)-E(a fe (^^ ( 34d > 



(34e) 
(34f) 
(34g) 
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We note from (|36] l that the duality gap does not depend on 
M, i.e., the total number of clients. Moreover, we note that 
p* — > oo as m — > oo (see the objective function of original 
problem __)). Thus, we conclude that, the relative duality gap 
(p* — d*)/p* diminishes to zero as M —> oo. 
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